GENERAL INFORMATION: 

(i) APPLICANTS: Goodearl, Andrew; Stroobant, Paul; 
Minghetti, Luisa; Waterfield, Michael; Mar chioni , /Mark ; 
Chen, Maio Su; Hiles, Ian / 

(ii) TITLE OF INVENTION: Glial Mitogenic Factors, Their 
Preparation and Use / 

(iii) NUMBER OF SEQUENCES: 184 / 

(iv) CORRESPONDENCE ADDRESS: / 

(A) ADDRESSEE: Felfe & Lynch / 

(B) STKEET: 305 Third Avenue / 

(C) CITY: N«v York City / 

(D) STATE: New York / 

(E) COUNTRY: USA / 

(F) ZIP: 10022 / 

(V) COMPUTER READABLE FORM: / 

(A) MEDIUM TYPE: Diskette, 5 . 2T5 inch, 360 kb storage 

(B) COMPUTER: IBM / 

(C) OPERATING SYSTEM: PC-DOST 

(D) SOFTWARE: Wordperfect / 

(Vi) CURRENT APPLICATION DATA: / 

(A) APPLICATION NUMBER: / 

(B) FILING DATE: / 

(C) CLASSIFICATION: / 

(vi) PRIOR APPLICATION DATa/ 

(A) APPLICATION NUMBER: 03/036,555 

(B) FILING DATE: 24yMAR-1993 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/965,173 

(B) FILING DATE: 23-0CT-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/940,389 

(B) FILING DATE: / 03-SEP-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/907,138 
(B) FILING DATE: 30-JUN-1992 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION/ NUMBER: 07/863,703 

(B) FILING DATE: 03-APRIL-1992 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: U.K. 91 07566.3 

(B) FILING DATE: 10-APRIL-1991 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Tsai, Christine H. 

(B) REGISTRATION NUMBER: 34,266 

(C) REFERENCE/ DOCKET NUMBER: LUD 5250.17 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (212) 688-9200 

(B) TELEFAX: (212) 838-3884 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



yJPhe Lys Gly Asp Ala His Thr Glu 

S! 1 5 



i (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 2: 
^ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 
J (C) STRANDEDNESS: 

" (D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 
' Xaa in position 12 is unknown, 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 3: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

Xaa in position 10 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lys 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 
_ (C) STRANDEDNESS: 

W (D) TOPOLOGY: linear 

50 (ix) FEATURE: 

51 (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

U < 

MXaa Lys Leu Gly Glu Met Trp Ala Glu 

in 1 5 



»=(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 5: 

! ~ (i) SEQUENCE CHARACTERISTICS: 

*J (A) LENGTH: 7 

y§ (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Leu Gly Glu Lys Arg Ala 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Xaa He Lys Ser Glu His Ala Gly Leu Ser He Gly Asp Thr Ala Lys 
1 5 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

_ (D) TOPOLOGY: linear 

D (ix) FEATURE: 

* . (D) OTHER INFORMATION: Xaa in position l is Lysine or Arginine, 

SJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Qfaa Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 
M 1 5 10 



^2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 8: 

(i) SEQUENCE CHARACTERISTICS: 
% (A) LENGTH: 16 

^ (B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Xaa He Lys Gly Glu His Pro Gly Leu Ser He Gly Asp Val Ala Lys 
1 5 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

and Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Xaa Met Ser Glu Tyr Ala Phe Phe Val Gin Thr Xaa Arg 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
3 (ix) FEATURE: 

3 (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 

J (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



gfCaa Ser Glu His Pro Gly Leu Ser lie Gly Asp Thr Ala Lys 

l~ 1 5 10 



7X2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 11: 
,1 (i) SEQUENCE CHARACTERISTICS: 

% (A) LENGTH: 10 

= (B) TYPE: amino acid 

Ji (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

Xaa in position 8 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Xaa Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginin 

Xaa in position 7 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Xaa Lys Leu Glu Phe Leu Xaa Ala Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 
O (C) STRANDEDNESS: 

CO (D) TOPOLOGY: linear 

SJ (ix) FEATURE: 

hi (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

gl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

D 

; R Xaa Thr Thr Glu Met Ala Ser Glu Gin Gly Ala 
r 1 5 io 



•^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 14: 
« (i) SEQUENCE CHARACTERISTICS: 

yi (A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position l is Lysine or Arginin 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Xaa Ala Lys Glu Ala Leu Ala Ala Leu Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Phe Val Leu Gin Ala Lys Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

i (D) TOPOLOGY: linear 

* (ix) FEATURE: 

■4 (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

-U (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

3Caa Leu Gly Glu Met Trp 

3 1 5 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 17: 
{ (i) SEQUENCE CHARACTERISTICS: 

l (A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Tyr Lys Cys Leu Lys Phe Lys Trp Phe Lys Lys Ala Thr Val Met 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 8 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



Glu Ala Lys Tyr Phe Ser Lys Xaa Asp Ala 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 2 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



Glu Xaa Lys Phe Tyr Val Pro 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Glu Leu Ser Phe Ala Ser Val Arg Leu Pro Gly Cys Pro Pro Gly Val 
1 5 10 15 



Asp Pro Met Val Ser Phe Pro Val Ala Leu 
20 25 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2003 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear • 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 31 and 32 could be either 

A or G. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GGAATTCCTT TTTTTTTTTT TTTTTTTCTT NNTTTTTTTT TGCCCTTATA CCTCTTCGCC 60 
TTTCTGTGGT TCCATCCACT TCTTCCCCCT CCTCCTCCCA TAAACAACTC TCCTACCCCT 120 
GCACCCCCAA TAAATAAATA AAAGGAGGAG GGCAAGGGGG GAGGAGGAGG AGTGGTGCTG 180 
CGAGGGGAAG GAAAAGGGAG GCAGCGCGAG AAGAGCCGGG CAGAGTCCGA ACCGACAGCC 240 
AGAAGCCCGC ACGCACCTCG CACC ATG AGA TGG CGA CGC GCC CCG CGC CGC 291 

Met Arg Trp Arg Arg Ala Pro Arg Arg 
1 5 

TCC GGG CGT CCC GGC CCC CGG GCC CAG CGC CCC GGC TCC GCC GCC CGC 339 
Ser Gly Arg Pro Gly Pro Arg Ala Gin Arg Pro Gly Ser Ala Ala Arg 
„ 10 15 20 25 

fflTCG TCG CCG CCG CTG CCG CTG CTG CCA CTA CTG CTG CTG CTG GGG ACC 387 
^iSer Ser Pro Pro Leu Pro Leu Leu Pro Leu Leu Leu Leu Leu Gly Thr 
W 30 35 40 

QGCG GCC CTG GCG CCG GGG GCG GCG GCC GGC AAC GAG GCG GCT CCC GCG 4 35 

MAI a Ala Leu Ala Pro Gly Ala Ala Ala Gly Asn Glu Ala Ala Pro Ala 

=fl 45 50 55 

-GGG GCC TCG GTG TGC TAC TCG TCC CCG CCC AGC GTG GGA TCG GTG CAG 483 

hGly Ala Ser Val Cys Tyr Ser Ser Pro Pro Ser Val Gly Ser Val Gin 

ry 60 65 70 

[JGAG CTA GCT CAG CGC GCC GCG GTG GTG ATC GAG GGA AAG GTG CAC CCG 531 

:f?Glu Leu Ala Gin Arg Ala Ala Val Val lie Glu Gly Lys Val His Pro 
75 80 85 

CAG CGG CGG CAG CAG GGG GCA CTC GAC AGG AAG GCG GCG GCG GCG GCG 579 
Gin Arg Arg Gin Gin Gly Ala Leu Asp Arg Lys Ala Ala Ala Ala Ala 
90 95 100 105 

GGC GAG GCA GGG GCG TGG GGC GGC GAT CGC GAG CCG CCA GCC GCG GGC 627 
Gly Glu Ala Gly Ala Trp Gly Gly Asp Arg Glu Pro Pro Ala Ala Gly 

110 115 120 

CCA CGG GCG CTG GGG CCG CCC GCC GAG GAG CCG CTG CTC GCC GCC AAC 675 
Pro Arg Ala Leu Gly Pro Pro Ala Glu Glu Pro Leu Leu Ala Ala Asn 
125 130 135 
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GGG ACC GTG CCC TCT TGG CCC ACC GCC CCG GTG CCC AGC GCC GGC GAG 723 
Gly Thr Val Pro Ser Trp Pro Thr Ala Pro Val Pro Ser Ala Gly Glu 
140 145 150 

CCC GGG GAG GAG GCG CCC TAT CTG GTG AAG GTG CAC CAG GTG TGG GCG 771 
Pro Gly Glu Glu Ala Pro Tyr Leu Val Lys Val His Gin Val Trp Ala 
155 160 165 

GTG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG CTC ACC GTG CGC CTG 819 
Val Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu Leu Thr Val Arg Leu 
170 175 180 185 

GGG ACC TGG GGC CAC CCC GCC TTC CCC TCC TGC GGG AGG CTC AAG GAG 867 
Gly Thr Trp Gly His Pro Ala Phe Pro Ser Cys Gly Arg Leu Lys Glu 

190 195 200 

GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAC GCC AAC AGC ACC AGC 915 
Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Asp Ala Asn Ser Thr Ser 
205 210 215 

CGC GCG CCG GCC GCC TTC CGA GCC TCT TTC CCC CCT CTG GAG ACG GGC 963 

Arg Ala Pro Ala Ala Phe Arg Ala Ser Phe Pro Pro Leu Glu Thr Gly 
_ 220 225 230 

SOCGG AAC CTC AAG AAG GAG GTC AGC CGG GTG CTG TGC AAG CGG TGC GCC 1011 

NArg Asn Leu Lys Lys Glu Val Ser Arg Val Leu Cys Lys Arg Cys Ala 

y 235 240 245 

OTTG CCT CCC CAA TTG AAA GAG ATG AAA AGC CAG GAA TCG GCT GCA GGT 1059 

M=Leu Pro Pro Gin Leu Lys Glu Met Lys Ser Gin Glu Ser Ala Ala Gly 
Jp250 255 260 265 

LTCC AAA CTA GTC CTT CGG TGT GAA ACC AGT TCT GAA TAC TCC TCT CTC 1107 
^Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
S 270 175 180 

? =AGA TTC AAG TGG TTC AAG AAT GGG AAT GAA TTG AAT CGA AAA AAC AAA 1155 
^Arg Phe Lys Trp Phe Lys Asn Gly Asn Glu Leu Asn Arg Lys Asn Lys 
y ' 185 190 195 

CCA CAA AAT ATC AAG ATA CAA AAA AAG CCA GGG AAG TCA GAA CTT CGC 1203 
Pro Gin Asn lie Lys lie Gin Lys Lys Pro Gly Lys Ser Glu Leu Arg 
200 205 210 

ATT AAC AAA GCA TCA CTG GCT GAT TCT GGA GAG TAT ATG TGC AAA GTG 1251 
lie Asn Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val 
215 220 225 

ATC AGC AAA TTA GGA AAT GAC AGT GCC TCT GCC AAT ATC ACC ATC GTG 1299 
He Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn He Thr He Val 
230 235 240 245 

110 



GAA TCA AAC GCT ACA TCT ACA TCC ACC ACT GGG ACA AGC CAT CTT GTA 
Glu Ser Asn Ala Thr Ser Thr Ser Thr Thr Gly Thr Ser His Leu Val 

250 255 260 
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AAA TGT GCG GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGG GAG TGC 
Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
265 270 275 
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TTC ATG GTG AAA GAC CTT TCA AAC CCC TCG AGA TAC TTG TGC AAG TGC 
Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 
280 285 290 



1443 



CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC 
Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser 
295 300 305 



1491 



TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA 
Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
400 405 410 



1530 



TAGGAGCATG 
AGAGCTAGAT 
TTAACAAAAG 
UAGGTGTGTGA 
MAGTCAATATC 
SJTAAAATAAAA 
yJAAGGGTGTTG 
Ofi CAGAATGTGT 



CTCAGTTGGT 
GTGTCTTACC 
CAATTGTATT 
GGCTCCGGAT 
AAGCAGTGAA 
ATCATTCTAC 
CTAAGCTGTA 
TATTTGTCAC 



GCTGCTTTCT 
AGATCTAATA 
ACTTCCTCTG 
GTTTCTGGAA 
ATATGATAAT 
TGAACAGTCC 
ACCGATATGC 
AAATAAACAT 



TGTTGCTGCA 
TTGACTGCCT 
TTCGCGACTA 
TTGATATTGA 
AAAGGCATTT 
ATCTTCTTTA 
ACTTGAAATG 
AATAAAAGGA 



TCTCCCCTCA 
CTGCCTGTCG 
GTTGGCTCTG 
ATGATGTGAT 
CAAAGTCTCA 
TACAATGACC 
ATGGTAAGTT 
AAAAAAAAAA 



GATTCCACCT 
CATGAGAACA 
AGATACTAAT 
ACAAATTGAT 
CTTTTATTGA 
ACATCCTGAA 
AATTTTGATT 
AAA 



1590 
1650 
1710 
1770 
1830 
1890 
1950 
2003 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 11 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



Ala Ser Leu Ala Asp Glu Tyr Glu Tyr Met Xaa Lys 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 9 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



Thr Glu Thr Ser Ser Ser Gly Leu Xaa Leu Lys 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

! (D) TOPOLOGY: linear 

I (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



IjAla Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 
3 1 5 10 



0(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 25: 

ry (i) SEQUENCE CHARACTERISTICS: 

Hj (A) LENGTH: 9 

^fj (B) TYPE: amino acid 

m (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 7 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Ala Gly Tyr Phe Ala Glu Xaa Ala Arg 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Thr Thr Glu Met Ala Ser Glu Gin Gly Ala 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 



fflAla Lys Glu Ala Leu Ala Ala Leu Lys 

SI 1 5 



[_i(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 28; 

[q (i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 7 

!\ (B) TYPE: amino acid 

£ (C) STRANDEDNESS : 

^ (D) TOPOLOGY: linear 

Li (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



^he Val Leu Gin Ala Lys Lys 
1 5 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 29: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: lin ar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



Glu Thr Gin Pro Asp Pro Gly Gin lie Leu Lys Lys Val Pro Met Val 
1 5 10 15 

lie Gly Ala Tyr Thr 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
D (ix) FEATURE: 

ffl (D) OTHER INFORMATION: Xaa in positions 1, 3, 17 and 19 is 

%J unknown. 

Cn (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

jjXaa Glu Xaa Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys Glu 
7 15 10 15 

f= 5 Xaa Gly Xaa Gly Lys 

51 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 31: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 6 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Lys Leu Glu Phe Leu Xaa Ala Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
p (ix) FEATURE: 

m (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

Sj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

jjl 

i!xaa Val His Gin Val Trp Ala Ala Lys 

H 1 5 



U(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 34: 
iU (i) SEQUENCE CHARACTERISTICS: 

fU (A) LENGTH: 14 

y3 (B) TYPE: amino acid 

01 (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine 

Xaa in position 11 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



Xaa Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 35: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine, 

Xaa in position 13 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Xaa Leu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Xaa Trp Phe Val Val lie Glu Gly Lys 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Xaa Ala Ser Pro Val Ser Val Gly Ser Val Gin Glu Leu Val Gin Arg 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 38: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Xaa Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 39: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 

(B) TYPE: amino acid 
n (C) STRANDEDNESS: 

m (D) TOPOLOGY: linear 

£ (ix) FEATURE: 

i= l (D) OTHER INFORMATION: Xaa in position 1 is Lysine or Arginine; 

2? Xaa in position 6 is unknown. 

j±j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

CXaa Asp Leu Leu Leu Xaa Val 

= 1 5 



y3(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 40: 
m (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Cys Thr Cys Gly Cys Cys Lys Cys Cys Arg Thr Thr Cys Ala Cys Arg 
15 10 15 

Cys Ala Gly Ala Ala Gly Gly Thr Cys Thr Thr Cys Thr Cys Cys Thr 
20 25 30 

Thr Cys Thr Cys Ala Gly Cys 
35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 41: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 



Cys Cys Thr Cys Gly Cys Thr Cys Cys Thr Thr Cys Thr Thr Cys Thr 
1 5 10 15 

Thr Gly Cys Cys Cys Thr Thr Cys 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 42: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 60 
O (B) TYPE: nucleic acid 

CO (C) STRANDEDNESS: single 

SI (D) TOPOLOGY: linear 

lij (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

yLAAGTGCCCAA ATGAGTTTAC TGGTGATCGC TGCCAAAACT ACGTAATGGC CAGCTTCTAC 



LH(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 43: 

l ^ (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 36 

0 1 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



AGTACGTCCA CTCCCTTTCT GTCTCTGCCT GAATAG 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

AAGGCGGAGG AGCTGTACCA GAAGAGAGTG CTGACCATAA CCGGCATCTG CATCGCCCTC 
CTTGTGGTCG GCATCATGTG TGTGGTGGCC TACTGCAAAA CCAAGAAACA GCGGAAAAAG 
CTGCATGACC GTCTTCGGCA GAGCCTTCGG TCTGAACGAA ACAATATGAT GAACATTGCC 
AATGGGCCTC ACCATCCTAA CCCACCCCCC GAGAATGTCC AGCTGGTGAA TCAATACGTA 
TCTAAAAACG TCATCTCCAG TGAGCATATT GTTGAGAGAG AAGCAGAGAC ATCCTTTTCC 
ACCAGTCACT ATACTTCCAC AGCCCATCAC TCCACTACTG TCACCCAGAC TCCTAGCCAC 
AGCTGGAGCA ACGGACACAC TGAAAGCATC CTTTCCGAAA GCCACTCTGT AATCGTGATG 
TCATCCGTAG AAAACAGTAG GCACAGCAGC CCAACTGGGG GCCCAAGAGG ACGTCTTAAT 
GGCACAGGAG GCCCTCGTGA ATGTAACAGC TTCCTCAGGC ATGCCAGAGA AACCCCTGAT 
TCCTACCGAG ACTCTCCTCA TAGTGAAAG 



tt§(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 45: 

S| (i) SEQUENCE CHARACTERISTICS: 

„y (A) LENGTH: 8 

ill (B) TYPE: amino acid 

H (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

'jy (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

ival His Gin Val Trp Ala Ala Lys 

^ 1 5 



*'(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 10 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
1 5 10 * 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 47: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



L u Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

* (D) TOPOLOGY: linear 

1 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



[Trp Phe Val Val lie Glu Gly Lys 

3 1 5 



g(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 49: 
flj (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 
!#* (B) TYPE: amino acid 

5° (C) STRANDEDNESS: 

¥ ' (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



Ala Ser Pro Val Ser Val Gly. Ser Val Gin Glu Leu Val Gin Arg 
15 10 15 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 50: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 



Val Cys Leu Leu Thr Val Ala Ala Leu Pro Pro Thr 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 



^Lys Val His Gin Val Trp Ala Ala Lys 

/i i 5 



^0(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 
I (C) STRANDEDNESS: 

I (D) TOPOLOGY: linear 

(ix) FEATURE: 

\ (D) OTHER INFORMATION: Xaa in position 12 is unknown, 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Xaa Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 5 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 



Asp Leu Leu Leu Xaa Val 
1 5 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 54: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
s (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 



^TTYAARGGNG AYGCNCAYAC 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 



CATRTAYTCR TAYTCRTCNG C 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 



TGYTCNGANG CCATYTCNGT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 



£3 TGYTCRCTNG CCATYTCNGT 20 

03 



p(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

y, (i) SEQUENCE CHARACTERISTICS: 

[n (A) LENGTH: 20 

T (B) TYPE: nucleic acid 

r, (C) STRANDEDNESS: single 

C (D) TOPOLOGY: linear 

!r? (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 



' CCDATNACCA TNGGNACYTT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 



GCNGCCCANA CYTGRTGNAC 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 60 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 



GCYTCNGGYT CCATRAARAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 61 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



JApCYTCDATNA CNACRAACCA 20 



H2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 62 

y3 (i) SEQUENCE CHARACTERISTICS: 

s (A) LENGTH: 17 

y, (B) TYPE: nucleic acid 

□ (C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 

hi (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 



yi TCNGCRAART ANCCNGC 17 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 63 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



GCNGCNAGNG CYTCYTTNGC 20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 



GCNGCYAANG CYTCYTTNGC 20 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



QTTYTTNGCYT GNAGNACRAA 20 

Sri; ; 
! . ! 

q(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
[2 (i) SEQUENCE CHARACTERISTICS: 

£ (A) LENGTH: 20 

(B) TYPE: nucleic acid 
j\ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
t! (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 



05 TTYTTNGCYT GYAANACRAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 



TGNACNAGYT CYTGNAC 17 



125 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 68 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



TGNACYAAYT CYTGNAC 17 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 69 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



fk CATRTAYTCN CCNGARTCNG C 21 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 70 

*i (i) SEQUENCE CHARACTERISTICS: 

~ (A) LENGTH: 21 

s (B) TYPE: nucleic acid 

M; (C) STRANDEDNESS: single 

O (D) TOPOLOGY: linear 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 



B. CATRTAYTCN CCRCTRTCNG C 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 71 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



NGARTCNGCY AANGANGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 



NGARTCNGCN AGNGANGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 



CRCTRTCNGCY AANGANGCYT T 21 



ff!(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

n (i) SEQUENCE CHARACTERISTICS: 

U (A) LENGTH: 21 

Is (B) TYPE: nucleic acid 

~ (C) STRANDEDNESS: single 

=\ (D) TOPOLOGY: linear 

ill (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



iSS = 

■ SRCTRTCNGCN AGNGANGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



NGARTCNGCY AARCTNGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 76 i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



NGARTCNGCN AGRCTNGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 77: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 



MGTATGTGTCA GCCATGACCA CCCCGGCTCG TATGTCACCT GTAGATTTCC ACACGCCAAG 60 

yJCTCCCCCAAA TCGCCCCCTT CGGAAATGTC TCCACCCGTG TCCAGCATGA CGGTGTCCAT 120 

UlGCCTTCCATG GCGGTCAGCC CCTTCATGGA AGAAGAGAGA CCTCTACTTC TCGTGACACC 180 

OACCAAGGCTG CGGGAGAAGA AGTTTGACCA TCACCCTCAG CAGTTCAGCT CCTTCCACCA 240 

MCAACCCCGCG CATGACAGTA ACAGCCTCCC TGCTAGCCCC TTGAGGATAG TGGAGGATGA 300 

y3 GGAGTATGAA ACGACCCAAG AGTACGAGCC AGCCCAAGAG CCTGTTAAGA AACTCGCCAA 360 

a TAGCCGGCGG GCCAAAAGAA CCAAGCCCAA TGGCCACATT GCTAACAGAT TGGAAGTGGA 420 

UCAGCAACACA AGCTCCCAGA GCAGTAACTC AGAGAGTGAA ACAGAAGATG AAAGAGTAGG 480 

pTGAAGATACG CCTTTCCTGG GCATACAGAA CCCCCTGGCA GCCAGTCTTG AGGCAACACC 540 

fyTGCCTTCCGC CTGGCTGACA GCAGGACTAA CCCAGCAGGC CGCTTCTCGA CACAGGAAGA 600 

^AATCCAGGCC AGGCTGTCTA GTGTAATTGC TAACCAAGAC CCTATTGCTG TATAAAACCT 660 

[ % AAATAAAC AC ATAGATTCAC CTGTAAAACT TTATTTTATA TAATAAAGTA TTCCACCTTA 720 

JEAATTAAACAA 730 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



RCTRTCNGCY AARCTNGCYT T 21 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



RCTRCTNGCN AGRCTNGCYT T 21 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



I^CNACNGARA TGGCTCNNGA 20 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

£ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

u (B) TYPE: nucleic acid 

fU (C) STRANDEDNESS: single 

flj (D) TOPOLOGY: linear 

=J3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 



ACNACNGARA TGGCAGYNGA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 



CAYCARGTNT GGGCNGCNAA 20 



129 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 83: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 



TTYGTNGTNA THGARGGNAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

p (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

t ;fAARGGNGAYG CNCAYACNGA 20 



INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 85: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



GARGCNYTNG CNGCNYTNAA 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 86: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 



GTNGGNTCNG TNCARGARYT 20 

130 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 



GTNGGNAGYG TNCARGARYT 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 88: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 



1 5 sNACYTTYTTN ARDATYTGNC C 21 



^(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 89: 

^ (i) SEQUENCE CHARACTERISTICS: 

f. (A) LENGTH: 417 

!T (B) TYPE: nucleic acid 

D (C) STRANDEDNESS: single 

id (D) TOPOLOGY: linear 

ry (ix) FEATURE: 

■<£} (D) OTHER INFORMATION: Xaa in positions 14, 23, 90, 100, 126, 

yi and 135 is a stop codon. 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

TCTAA AAC TAC AGA GAC TGT ATT TTC ATG ATC ATC ATA GTT CTG TGA AAT ATA 53 
Asn Tyr Arg Asp Cys lie Phe Met lie lie lie Val Leu Xaa Asn lie 
15 10 15 

CTT AAA CCG CTT TGG TCC TGA TCT TGT AGG AAG TCA GAA CTT CGC ATT 101 
Leu Lys Pro Leu Trp Ser Xaa Ser Cys Arg Lys Ser Glu Leu Arg lie 
20 25 30 

AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC 149 
Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Ser Met Cys Lys Val lie 
35 40 45 
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AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG 197 
Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn lie Arg lie Val Glu 
50 55 60 

TCA AAC GGT AAG AGA TGC CTA CTG CGT GCT ATT TCT CAG TCT CTA AGA 245 
Ser Asn Gly Lys Arg Cys Leu Leu Arg Ala II Ser Gin Ser Leu Arg 
65 70 75 80 

GGA GTG ATC AAG GTA TGT GGT CAC ACT TGA ATC ACG CAG GTG TGT GAA 293 
Gly Val lie Lys Val Cys Gly His Thr Xaa lie Thr Gin Val Cys Glu 

85 90 95 

ATC TCA TTG TGA ACA AAT AAA AAT CAT GAA AGG AAA ACT CTA TGT TTG 341 
II Ser Cys Xaa Thr Asn Lys Asn His Glu Arg Lys Thr Leu Cys Leu 
100 105 110 

AAA TAT CTT ATG GGT CCT CCT GTA AAG CTC TTC ACT CCA TAA GGT GAA 389 
Lys Tyr Leu Met Gly Pro Pro Val Lys Leu Phe Thr Pro Xaa Gly Glu 
115 120 125 

ATA GAC CTG AAA TAT ATA TAG ATT ATT T 417 
lie Asp Leu Lys Tyr lie Xaa lie lie 
130 135 



W(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 90: 

&J (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 33 

M= (B) TYPE: nucleic acid 

£ (C) STRANDEDNESS : single 

a (D) TOPOLOGY: linear 

U (ix) FEATURE: 

O (D) OTHER INFORMATION: N at positions 19, 25, and 31 is Inosine, 

fy Y can be cytidine or thymidine, 

flj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

" " CCGAATTCTG CAGGARACNC ARCCNGAYCC NGG 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 91: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 20, 23, 29, and 35 is 

Inosine. 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
AAGGATCCTG CAGNGTRTAN GCNCCDATNA CCATNGG 37 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 92: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

_ (C) STRANDEDNESS: single 

U (D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16, 21, and 24 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 



Sri 



■nCCGAATTCTG CAGGCNGAYT CNGGNGARTA YATG 34 



HI 

k~»(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 93: 
!n (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 
y " (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: iinear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16 and 25 is Inosine. Y 

can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCGAATTCTG CAGGCNGAYA GYGGNGARTA YAT 33 
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I 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 94: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 14, 15, 16, 26, and 29 is 

Inosine. Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 



AAGGATCCTG CAGNNNCATR TAYTCNCCNG ARTC 34 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 95: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
3 (ix) FEATURE: 

fl (D) OTHER INFORMATION: N at positions 14, 15, 16, and 26 is 

j Inosine. Y can be cytidine or thymidine, 

j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

: AAGGATCCTG CAGNNNCATR TAYTCNCCRC TRTC 34 



=(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 96: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 33 

(B) TYPE: nucleic acid 
Cm (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 21, 28, and 31 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 



CCGAATTCTG CAGCAYCARG TNTGGGCNGC NAA 33 



134 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 97: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 31 is Inosine. Y can be 

cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

CCGAATTCTG CAGATHTTYT TYATGGARCC NGARG 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 98: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

-== (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
1 (ix) FEATURE: 

"1 (D) OTHER INFORMATION: N at positions 18, 21, 24, 27, and 33 is 

f Inosine. Y can be cytidine or thymidine. 

"2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

JCCGAATTCTG CAGGGGGNCC NCCNGCNTTY CCNGT 35 



j(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 99: 

j (i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 33 

\ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: "r< 

(D) OTHER INFORMATION: N at positions 21 and 24 is Inosin 

i can be cytidine or thymidine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

CCGAATTCTG CAGTGGTTYG TNGTNATHGA RGG 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 100: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 17, 20, and 26 is Inosine. 

Y can be cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



AAGGATCCTG CAGYTTNGCU NGCCCANACY TGRTG 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 101: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 19 is Inosine. Y can be 

cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 



^AAGGATCCTG CAGGCYTCNG GYTCCATRAA RAA 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 102: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 16, 22, 25, 28, and 31 is 

Inosine. 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 



AAGGATCCTG CAGACNGGRA ANGCNGGNGG NCC 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 103: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at positions 17, 26, and 29 is Inosine. 

Y can be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 



AAGGATCCTG CAGYTTNCCY TCDJLTNACNA CRAAC 35 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 104: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
3 (ix) FEATURE: 

5 (D) OTHER INFORMATION: N at position 18 is Inosine. Y can be 

j cytidine or thymidine. 

J (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

'CATRTAYTCR TAYTCTCNGC AAGGATCCTG CAG 33 



EH (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 105: 
nJ (i) SEQUENCE CHARACTERISTICS: 

fU (A) LENGTH: 33 

(B) TYPE: nucleic acid 
yi (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: « 

(D) OTHER INFORMATION: N at position 19, 25, and 31 is Inosine. 

Y can be cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



CCGAATTCTG CAGAARGGNG AYGCNCAYAC NGA 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 106: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3 and 18 is Inosine. Y can 

be cytidine or thymidine, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

GCNGCYAANG CYTCYTTNGC AAGGATCCTG CAG 33 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 107: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
3 (ix) FEATURE: 

^ (D) OTHER INFORMATION: N at position 3, 6, 9, and 18 is Inosine. 

1 Y can be cytidine or thymidine. 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

"S? 5! 

- ! GCNGCNAGNG CYTCYTTNGC AAGGATCCTG CAG 33 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 108: 
! (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N at position 3, 12, and 15 is Inosine. Y 

can be cytidine or thymidine. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

TCNGCRAART ANCCNGCAAG GATCCTGCAG 30 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 



CATCGATCTG CAGGCTGATT CTGGAGAATA TATGTGCA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 



mAAGGATCCTG CAGCCACATC TCGAGTCGAC ATCGATT 

"j t 
M 

UJ 

i°f(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

*t (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

s (B) TYPE: nucleic acid 

t: (C) STRANDEDNESS: single 

□ (D) TOPOLOGY: linear 

nj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 



£f! CCGAATTCTG CAGTGATCAG CAAACTAGGA AATGACA 3 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CATCGATCTG CAGCCTAGTT TGCTGATCAC TTTGCAC 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 



AAGGATCCTG CAGTATATTC TCCAGAATCA GCCAGTG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 



□AAGGATCCTG CAGGCACGCA GTAGGCATCT CTTA 



S(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 

~ (i) SEQUENCE CHARACTERISTICS: 

j~ (A) LENGTH: 35 

\~. (B) TYPE: nucleic acid 

- (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

£ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



fy CCGAATTCTG CAGCAGAACT TCGCATTAGC AAAGC 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 



CATCCCGGGA TGAAGAGTCA GGAGTCTGTG GCA 33 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 



ATACCCGGGC TGCAGACAAT GAGATTTCAC ACACCTGCG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 



qAAGGATCCTG CAGTTTGGAA CCTGCCACAG ACTCCT 



INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



njATACCCGGGC TGCAGATGAG ATTTCACACA CCTGCGTGA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



His Gin Val Trp Ala Ala Lys Ala Ala Gly Leu Lys 
1 5 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 121: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Gly Gly Leu Lys Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Ala Asn 
15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 122: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: , 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

iLeu Gly Ala Trp Gly Pro Pro Ala Phe Pro Val Xaa Tyr 

. 1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 123: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

! (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Leu Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser 
1 5 . 10 15 

Cys Gly Arg Leu Lys Glu Asp 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 124: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 10 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Tyr lie Phe Phe Met Glu Pro Glu Ala Xaa Ser Ser Gly 
15 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 125: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

DP 

viLys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu Ala Asn Ser 

hj 1 5 10 15 

i^Ser Gly Gly Pro Gly Arg Leu 
20 



?T(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 126: 

5 (i) SEQUENCE CHARACTERISTICS: 
[H (A) LENGTH: 14 

TU (B) TYPE: amino acid 

6 (C) STRANDEDNESS: 

01 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 



Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 127: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

Glu Tyr Lys Cys Leu Lys Phe Lys Trp Phe Lys Lys Ala Thr Val Met 
15 10 15 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 128: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 



= Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys 
i 1 5 10 15 

-JAsn Gly Ser Glu Leu Ser Arg Lys Asn Lys 
J 20 25 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 129: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 12 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 

Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Xaa Lys 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 130: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 



Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met 
1 5 10 15 

Cys Lys Val lie Ser Lys Leu 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 131: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 



SAla Ser Leu Ala Asp Glu Tyr Glu Tyr Met Arg Lys 

3 



jf(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 132: 

U (i) SEQUENCE CHARACTERISTICS: 

[0 (A) LENGTH: 22 

fU (B) TYPE: amino acid 

m (C) STRANDEDNESS: 

ffi (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 



L u Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys 
15 10 15 

Lys Val lie Ser Lys Leu 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 133: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : singl 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

CCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 
His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 
1 5 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 
Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 

Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 

35 40 45 

GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 

==Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 

50 55 60 

/pCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 247 
yfSer Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
y*65 70 75 80 

HCAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 295 
d3Gln Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 
a 85 90 95 

QTCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 
fySer Val Aia Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 
III 100 105 110 

fnTAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 391 
"Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lys Asn Lys Pro Glu Asn lie Lys lie Gin Lys Arg Pro Gly Lys 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 

165 170 175 

146 



ATC ACC ATT GTG GAG TCA AAC GGT AAG AGA TGC 
lie Thr He Val Glu Ser Asn Gly Lys Arg Cys 
180 185 



CTA CTG CGT GCT ATT 
L u L u Arg Ala II 
190 



583 



TCT CAG TCT CTA AGA GGA GTG ATC AAG GTA TGT GGT CAC ACT 625 
Ser Gin Ser Leu Arg Gly Val He Lys Val Cys Gly His Thr 
195 200 205 

TGAATCACGC AGGTGTGTGA AATCTCATTG TGAACAAATA AAAATCATGA AAGGAAAAAA 685 
AAAAAAAAAA AATCGATGTC GACTCGAGAT GTGGCTGCAG GTCGACTCTA GAGGATCCC 744 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 134: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1193 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 



MCCTGCAG 

= 5 5 


CAT 
His 
1 


CAA 
Gin 


GTG 
Val 


TGG 
Trp 


GCG 
Ala 
5 


GCG 
Ala 


AAA 
Lys 


GCC 
Ala 


GGG 
Gly 


GGC 
Gly 
10 


TTG 
Leu 


AAG 
Lys 


AAG 
Lys 


GAC 
Asp 


TCG 
Ser 
15 


CTG 
Leu 


55 


01CTC 
OLeu 


ACC 
Thr 


GTG 
Val 


CGC 
Arg 
20 


CTG 
Leu 


GGC 
Gly 


GCC 
Ala 


TGG 
Trp 


GGC 
Gly 
25 


CAC 
His 


CCC 
Pro 


GCC 
Ala 


TTC 
Phe 


CCC 
Pro 
30 


TCC 
Ser 


TGC 
Cys 






103 


r°GGG 
MGly 

ass. 


CGC 
Arg 


CTC 
Leu 
35 


AAG 
Lys 


GAG 
Glu 


GAC 
Asp 


AGC 
Ser 


AGG 
Arg 
40 


TAC 
Tyr 


ATC 
He 


TTC 
Phe 


TTC 
Phe 


ATG 
Met 
45 


GAG 
Glu 


CCC 
Pro 


GAG 
Glu 






151 


mGCC 
'iAla 


AAC 
Lys 
50 


AGC 
Ser 


AGC 
Ser 


GGC 
Gly 


GGG 
Gly 


CCC 
Pro 
55 


GGC 
Gly 


CGC 
Arg 


CTT 
Leu 


CCG 
Pro 


AGC 
Ser 
60 


CTC 
Leu 


CTT 
Leu 


CCC 
Pro 


CCC 
Pro 






199 


TCT 
Ser 
65 


CGA 
Arg 


GAC 
Asp 


GGG 
Gly 


CCG 
Pro 


GAA 
Glu 
70 


CCT 
Pro 

« 


CAA 
Gin 


GAA 
Glu 


GGA 
Gly 


GGT 
Gly 
75 


CAG 
Gin 


CCG GGT 
Pro Gly 


GCT 
Ala 


GTG 
Val 
80 






247 


CAA 
Gin 


CGG 
Arg 


TGC 
Cys 


GCC 
Ala 


TTG 
Leu 
85 


CCT 
Pro 


CGC 
Pro 


CGC 
Arg 


TTG 
Leu 


AAA 
Lys 
90 


GAG 
Glu 


ATG 
Met 


AAG 
Lys 


AGT 
Ser 


CAG 
Gin 
95 


GAG 
Glu 






295 


TCT 
Ser 


GTG 
Val 


GCA 
Ala 


GGT 
Gly 


TCC 
Ser 


AAA 
Lys 


CTA 
Leu 


GTG 
Val 


CTT 
Leu 


CGG 
Arg 


TGC 
Cys 


GAG 
GlU 


ACC 

Thr 


AGT 
Ser 


TCT 
Ser 


GAA 
Glu 






343 



100 105 110 
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TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 
Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 



391 



CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 
Arg Lys Asn Lys Gly Gly Asn lie Lys lie Gin Lys Arg Pro Gly Lys 
130 135 140 



439 



TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 
Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 



487 



ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 
Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 

165 170 175 



535 



ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 
lie Thr lie Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly Thr 
180 185 190 
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AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
Ser His Leu Val Lys Ser Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
195 200 205 



631 



^GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
^Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
W 210 215 220 



679 



OTTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 
MLeu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
JJ225 230 235 240 



727 



UGTG CCC ATG AAA GTC CAA ACC CAA GAA AGT GCC CAA ATG AGT TTA CTG 
QVal Pro Met Lys Val Gin Thr Gin Glu Ser Ala Gin Met Ser Leu Leu 

Pii 245 250 255 



775 



*GTG ATC GCT GCC AAA ACT ACG TAATGGCCAG CTTCTACAGT ACGTCCACTC 
,Val lie Ala Ala Lys Thr Thr 
260 



826 



CCTTTCTGTC 
TCCCCTCAGA 
GCCTGTCGCA 
GGGCTCTGAG 
ACTGTGATAC 
GTCAAAAAAA 
TCTAGAG 



TCTGCCTGAA 
TTCCTCCTAG 
TGAGAACATT 
CTACTCGTAG 
GACATGATAG 
AAAAAAAAAA 



TAGCGCATCT 
AGCTAGATGC 
AACACAAGCG 
GTGCGTAAGG 
TCCCTCTCAC 
AAAAAATCGA 



CAGTCGGTGC 
GTTTTACCAG 
ATTGTATGAC 
CTCCAGTGTT 
CCAGTGCAAT 
TGTCGACTCG 



CGCTTTCTTG 
GTCTAACATT 
TTCCTCTGTC 
TCTGAAATTG 
GACAATAAAG 
AGATGTGGCT 



TTGCCGCATC 
GACTGCCTCT 
CGTGACTAGT 
ATCTTGAATT 
GCCTTGAAAA 
GCAGGTCGAC 



886 
946 
1006 
1066 
1126 
1186 
1193 



148 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 135: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1108 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

CCTGCAG CAT CAA GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 55 
His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 
15 10 15 

CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 103 
L u Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 

GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 151 
Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 
35 40 45 

GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 199 
Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 
O 50 55 60 

MTCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT GTG 247 
LySer Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala Val 
01 65 70 75 80 

UCAA CGG TGC GCC TTG CCT CCC CGC TTG AAA GAG ATG AAG AGT CAG GAG 295 

ifjGln Arg Cys Ala Leu Pro Pro Arg Leu Lys Glu Met Lys Ser Gin Glu 

7 85 90 95 

CTCT GTG GCA GGT TCC AAA CTA GTG CTT CGG TGC GAG ACC AGT TCT GAA 343 

51 Ser Val Ala Gly Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu 

j,* 100 105 , 110 

£TAC TCC TCT CTC AAG TTC AAG TGG TTC AAG AAT GGG AGT GAA TTA AGC 391 

Lsi§Tyr Ser Ser Leu Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser 
115 120 125 

CGA AAG AAC AAA CCA GAA AAC ATC AAG ATA CAG AAA AGG CCG GGG AAG 439 
Arg Lys Asn Lys Pro Glu Asn lie Lys lie Gin Lys Arg Pro Pro Lys 
130 135 140 

TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA TAT 487 
Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu Tyr 
145 150 155 160 

ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT GCC AAC 535 
Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala Asn 

165 170 175 



149 



ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG ACA 
lie Arg He Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly Thr 
180 185 190 



583 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
195 200 205 



631 



GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
210 215 220 



679 



TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 
L u Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
225 230 235 240 



727 



GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT 
Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 

245 250 255 



775 



GAA TAGCGCATCT CAGTCGGTGC CGCTTTCTTG TTGCCGCATC TCCCCTCAGA TTCCGCCTAG 
Glu 



838 



PAGCTAGATGC GTTTTACCAG GTCTAACATT GACTGCCTCT GCCTGTCGCA TGAGAACATT 
COAACACAAGCG ATTGTATGAC TTCCTCTGTC CGTGACTAGT GGGCTCTGAG CTACTCGTAG 
\JGTGCGTAAGG CTCCAGTGTT TCTGAAATTG ATCTTGAATT ACTGTGATAC GACATGATAG 
yTCCCTCTCAC CCAGTGCAAT GACAATAAAG GCCTTGAAAA GTCAAAAAAA AAAAAAAAAA 
ffiAAAAATCGAT GTCGACTCGA GATGTGGCTG 



898 
958 
1018 
1078 
1108 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 136: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 559 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in position 214 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 



AGTTTCCCCC 
GGCGGCTGCC 
TGCGAGCGCG 
CCAGCGGCGC 
AGTCCCAGGT 
GCTCCCCCCC 
AAACTTTTCC 
CGGGAGCCGT 



CCCAACTTGT 
CAGGCGATGC 
CCGGACCGAG 
GCCAGCAGGA 
GGCCCGGACC 
ACGCCGCGCG 
CGAAGCCGAT 
CCGCGCAGAG 



CGGAACTCTG 
GAGCGCGGGC 
GCAGCGACAG 
GCCACCCCGC 
GCACGTTGCG 
CGCCTCGGCC 
CCCAGCCCTC 
CGTGCACTTC 



GGCTCGCGCG CAGGGCAGGA 
CGGACGGTAA TCGCCTCTCC 
GAGCGGACCG CGGCGGGAAC 
GAGNCGTGCG ACCGGGACGG 
TCCCCGCGCT CCCCGCCGGC 
CGGTCGCTGG CCCGCCTCCA 
GGACCCAAAC TTGTCGCGCG 
TCGGGCGAG ATG TCG GAG 
Met Ser Glu 
1 



GCGGAGCGGC 
CTCCTCGGGC 
CGAGGACTCC 
AGCGCCCGCC 
GACAGGAGAC 
CTCCGGGGAC 
TCGCCTTCGC 
CGC AGA 
Arg Arg 
5 



60 
120 
180 
240 
300 
360 
420 
474 



150 



6AA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CGA GGC TCC GGG 522 
Glu Gly Lys Gly Lys Gly Lys Gly Gly Lys Lys Asp Arg Gly Ser Gly 

10 15 20 



AAG AAG CCC GTG CCC GCG GCT GGC GGC CCG AGC CCA G 
Lys Lys Pro Val Pro Ala Ala Gly Gly Pro Ser Pro Ala 
25 30 



559 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 137: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in position 8 could be either A or G. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



CC CAT CAN GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG 47 
= His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser 
!5 1 5 10 15 

MCTG CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC 95 
w Leu Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser 

jJj 20 25 30 

MTGC GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC 143 
CfCys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro 
* 35 40 45 

Q GAG GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC 191 
m Glu Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro 

ffj 50 55 60 

.f. 

CCC TCT CGA GAC GGG CCG GAA CCT CAA GAA GGA GGT CAG CCG GGT GCT 239 
w Pro Ser Arg Asp Gly Pro Glu Pro Gin Glu Gly Gly Gin Pro Gly Ala 
65 70 75 

GTG CAA CGG TGC G 252 
Val Gin Arg Cys 
80 



151 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 138: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



CCT TGC CTC CCC GCT TGA AAG AGA TGA AGA GTC AGG AGT CTG TGG CAG 
Leu Pro Pro Arg Leu Lys Glu His Lys Ser Gin Glu Ser Val Ala Gly 
15 10 15 

GTT CCA AAC TAG TGC TTC GGT GCG AGA CCA GTT CTG AAT ACT CCT CTC 
Ser Lys Leu Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu 
20 25 30 

TCA AGT TCA AGT GGT TCA AGA ATG GGA GTG AAT TAA GCC GAA AGA ACA 
Lys Phe Lys Trp Phe Lys Asn Gly Ser Glu Leu Ser Arg Lys Asn Lys 
35 40 45 

AAC CAC AAA ACA TCA AGA TAC AGA AAA GGC CGG G 
^Pro Gly Asn lie Lys lie Gin Lys Arg Pro Gly 

u 50 55 

80 



1(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 139: 

i (i) SEQUENCE CHARACTERISTICS: 

j (A) LENGTH: 122 

(B) TYPE: nucleic acid 

h (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

\ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



p AAG TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA 
s Lys Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly 
1 5 10 15 

GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA GGA AAT GAC AGT GCC TCT 
Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser 

20 25 30 

GCC AAC ATC ACC ATT GTG GAG TCA AAC G 
Ala Asn lie Thr lie Val Glu Ser Asn Ala 

35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 140: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

TCTAAAACTA CAGAGACTGT ATTTTCATGA TCATCATAGT TCTGTGAAAT ATACTTAAAC 
CGCTTTGGTC CTGATCTTGT AGG AAG TCA GAA CTT CGC ATT AGC AAA GCG 

Lys Ser Glu Leu Arg lie Ser Lys Ala 

l 5 

TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC AGC AAA CTA 
Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu 
10 15 20 25 

GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC GGT 
Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Gly 

30 35 40 

AAG AGA TGC CTA CTG CGT GCT ATT TCT CAG TCT CTA AGA GGA GTG ATC 
Lys Arg Cys Leu Leu Arg Ala lie Ser Gin Ser Leu Arg Gly Val lie 
45 50 55 

^AAG GTA TGT GGT CAC ACT TGAATCACGC AGGTGTGTGA AATCTCATTG 
f Lys Val Cys Gly His Thr 
60 

TGAACAAATA AAAATCATGA AAGGAAAACT CTATGTTTGA AATATCTTAT GGGTCCTCCT 
= GTAAAGCTCT TCACTCCATA AGGTGAAATA GACCTGAAAT ATATATAGAT TATTT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 141: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AG ATC ACC ACT GGC ATG CCA GCC TCA ACT GAG ACA GCG TAT GTG TCT 
Glu lie Thr Thr Gly Met Pro Ala Ser Thr Glu Thr Ala Tyr Val Ser 
15 10 15 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 
Ser Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Thr Asn Thr 
20 25 30 

TCT TCA T 
Ser Ser Ser 
35 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 142: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 



AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT GTG CCC 
Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro 
15 10 15 

ATG AAA GTC CAA ACC CAA GAA 
Met Lys Val Gin Thr Gin Glu 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 143: 
(i) SEQUENCE CHARACTERISTICS: 
_ (A) LENGTH: 60 

S (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
"1 (D) TOPOLOGY: linear 

W (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

gi 

MAAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG 
iflLys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met 
1 .5 10 15 

qGCC AGC TTC TAC 
FyAla Ser Phe Tyr 
m 20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 144: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: j}6 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 



AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAG 
Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
1 5 10 



154 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 145: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 



AAG CAT CTT GGG ATT GAA TTT ATG GAG 
Lys His Leu Gly He Glu Phe Met Glu 
1 5 



27 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 146: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



^AAA GCG GAG GAG CTC TAC CAG AAG AGA GTG CTC ACC ATT ACC GGC ATT 48 
OjLys Ala Glu Glu Leu Tyr Gin Lys Arg Val Leu Thr He Thr Gly He 
U 1 5 10 15 

■fiTGC ATC GCG CTG CTC GTG GTT GGC ATC ATG TGT GTG GTG GTC TAC TGC 96 
= Cys He Ala Leu Leu Val Val Gly He Met Cys Val Val Val Tyr Cys 
M 20 25 30 

5 AAA ACC AAG AAA CAA CGG AAA AAG CTT CAT GAC CGG CTT CGG CAG AGC 144 
rnLys Thr Lys Lys Gin Arg Lys Lys Leu His Asp Arg Leu Arg Gin Ser 
!n 35 40 45 

" 3 CTT CGG TCT GAA AGA AAC ACC ATG ATG AAC GTA GCC AAC GGG CCC CAC 192 
Leu Arg Ser Glu Arg Asn Thr Met Met Asn Val Ala Asn Gly Pro His 
50 55 60 

CAC CCC AAT CCG CCC CCC GAG AAC GTG CAG CTG GTG AAT CAA TAC GTA 240 
His Pro Asn Pro Pro Pro Glu Asn Val Gin Leu Val Asn Gin Tyr Val 
65 70 75 80 

TCT AAA AAT GTC ATC TCT AGC GAG CAT ATT GTT GAG AGA GAG GCG GAG 288 
Ser Lys Asn Val lie Ser Ser Glu His He Val Glu Arg Glu Ala Glu 

85 90 95 

AGC TCT TTT TCC ACC AGT CAC TAC ACT TCG ACA GCT CAT CAT TCC ACT 336 
S r Ser Phe Ser Thr ser His Tyr Thr Ser Thr. Ala His His Ser Thr 
100 105 110 



155 



ACT GTC ACT CAG ACT CCC AGT CAC AGC TGG AGC AAT GGA CAC ACT GAA 
Thr Val Thr Gin Thr Pro Ser His Ser Trp Ser Asn Gly His Thr Glu 
115 120 125 

AGC ATC ATT TCG GAA AGC CAC TCT GTC ATC GTG ATG TCA TCC GTA GAA 
Ser lie lie Ser Glu Ser His Ser Val lie Val Met Ser Ser Val Glu 
130 135 140 

AAC AGT AGG CAC AGC AGC CCG ACT GGG GGC CCG AGA GGA CGT CTC AAT 
Asn Ser Arg His Ser Ser Pro Thr Gly Gly Pro Arg Gly Arg Leu Asn 
145 150 155 160 

GGC TTG GGA GGC CCT CGT GAA TGT AAC AGC TTC CTC AGG CAT GCC AGA 
Gly Leu Gly Gly Pro Arg Glu Cys Asn Ser Phe Leu Arg His Ala Arg 

165 170 175 

GAA ACC CCT GAC TCC TAC CGA GAC TCT CCT CAT AGT G AAAG 
Glu Thr Pro Asp Ser Tyr Arg Asp Ser Pro His Ser 
180 185 



«(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 147: 

m (i) SEQUENCE CHARACTERISTICS: 

£5 (A) LENGTH: 730 

=1 (B) TYPE: nucleic acid 

!S (C) STRANDEDNESS : single 

ii; (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



s G TAT GTA TCA GCA ATG ACC ACC CCG GCT CGT ATG TCA CCT GTA GAT 
H Tyr Val Ser Ala Met Thr Thr Pro Ala Arg Met Ser Pro Val Asp 
O l 5 10 15 



fUTTC CAC ACG CCA AGC TCC CCC AAG TCA CCC CCT TCG GAA ATG TCC CCG 
Phe His Thr Pro Ser Ser Pro Lys Ser Pro Pro Ser Glu Met Ser Pro 

20 25 30 



CCC GTG TCC AGC ACG ACG GTC TCC ATG CCC TCC ATG GCG GTC AGT CCC 
Pro Val Ser Ser Thr Thr Val Ser Met Pro Ser Met Ala Val Ser Pro 
35 40 45 

TTC GTG GAA GAG GAG AGA CCC CTG CTC CTT GTG ACG CCA CCA CGG CTG 
Phe Val Glu Glu Glu Arg Pro Leu Leu Leu Val Thr Pro Pro Arg Leu 
50 55 60 



CGG GAG AAG TAT GAC CAC CAC GCC CAG CAA TTC AAC TCG TTC CAC TGC 
Arg Glu Lys Tyr Asp His His Ala Gin Gin Phe Asn Ser Phe His Cys 
65 70 75 



156 



AAC 


ccc 


GCG 


CAT 


GAG 


AGC 


AAC 


AGC 


CTG 


CCC 


CCC 


AGC 


CCC 


TTG 


AGG 


ATA 


286 


Asn 


Pro 


Ala 


His 


Glu 


Ser 


Asn 


Ser 


Leu 


Pro 


Pro 


Ser 


Pro 


Leu 


Arg 


He 




80 










85 










90 










95 




GTG 


GAG 


GAT 


GAG 


GAA 


TAT 


GAA 


ACG 


ACC 


CAG 


GAG 


TAC 


GAA 


CCA 


GCT 


CAA 


334 


Val 


Glu 


Asp 


Glu 


Glu 


Tyr 


Glu 


Thr 


Thr 


Gin 


Glu 


Tyr 


Glu 


Pro 


Ala 


Gin 












100 










105 










110 






GAG 


CCG 


GTT 


AAG 


AAA 


CTC 


ACC 


AAC 


AGC 


AGC 


CGG 


CGG 


GCC 


AAA 


AGA 


ACC 


382 


Glu 


Pro 


Val 


Lys 


Lys 


Leu 


Thr 


Asn 


Ser 


Ser 


Arg 


Arg 


Ala 


Lys 


Arg 


Thr 










115 










120 










125 








AAG 


CCC 


AAT 


GGT 


CAC 


ATT 


GCC 


CAC 


AGG 


TTG 


GAA 


ATG 


GAC 


AAC 


AAC 


ACA 


430 


Lys 


Pro 


Asn 


Gly 


His 


He 


Ala 


His 


Arg 


Leu 


Glu 


Met 


Asp 


Asn 


Asn 


Thr 








130 










135 










140 










GGC 


GCT 


GAC 


AGC 


AGT 


AAC 


TCA 


GAG 


AGC 


GAA 


ACA 


GAG 


GAT 


GAA 


AGA 


GTA 


478 


Gly 


Ala 


Asp 


Ser 


Ser 


Asn 


Ser 


Glu 


Ser 


Glu 


Thr 


Glu 


Asp 


Glu 


Arg 


Val 






145 










150 










155 












GGA 


GAA 


GAT 


ACG 


CCT 


TTC 


CTG 


GCC 


ATA 


CAG 


AAC 


CCC 


CTG 


GCA 


GCC 


AGT 


526 


Gly 


Glu 


Asp 


Thr 


Pro 


Phe 


Leu 


Ala 


He 


Gin 


Asn 


Pro 


Leu 


Ala 


Ala 


Ser 




„160 








165 










170 










175 




: .s=? 

WCTC 


GAG 


GCG 


GCC 


CCT 


GCC 


TTC 


CGC 


CTG 


GTC 


GAC 


AGC 


AGG 


ACT 


AAC 


CCA 


574 


MLeu 


Glu 


Ala 


Ala 


Pro 


Ala 


Phe 


Arg 


Leu 


Val 


Asp 


Ser 


Arg 


Thr 


Asn 


Pro 




U 








180 










185 










190 






DACA 


GGC 


GGC 


TTC 


TCT 


CCG 


CAG 


GAA 


GAA 


TTG 


CAG 


GCC 


AGG 


CTC 


TCC 


GGT 


622 


M=Thr 


Gly 


Gly 


Phe 


Ser 


Pro 


Gin 


Glu 


Glu 


Leu 


Gin 


Ala 


Arg 


Leu 


Ser 


Gly 










195 










200 










205 








Lgta 


ATC 


GCT 


AAC 


CAA 


GAC 


CCT 


ATC 


GCT 


GTC 


TAAAACCGAA ATACACCCAT 


672 


Fjval 


He 


Ala 


Asn 


Gin 


Asp 


Pro 


He 


Ala 


Val 




















210 










215 





















^'AGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT CCACCTTAAA TTAAACAA 730 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 148: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 # 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 



AGTTTCCCCC CCCAACTTGT CGGAACTCTG GGCTCGCGCG CAGGGCAGGA GCGGAGCGGC 60 

GGCGGCTGCC CAGGCGATGC GAGCGCGGGC CGGACGGTAA TCGCCTCTCC CTCCTCGGGC 120 

TGCGAGCGCG CCGGACCGAG GCAGCGACAG GAGCGGACCG CGGCGGGAAC CGAGGACTCC 180 

157 



CCAGCGGCGC GCCAGCAGGA GCCACCCCGC GAGCGTGCGA CCGGGACGGA GCGCCCGCCA 240 
GTCCCAGGTG GCCCGGACCG CACGTTGCGT CCCCGCGCTC CCCGCCGGCG ACAGGAGACG 300 
CTCCCCCCCA CGCCGCGCGC GCCTCGGCCC GGTCGCTGGC CCGCCTCCAC TCCGGGGACA 360 
AACTTTTCCC GAAGCCGATC CCAGCCCTCG GACCCAAACT TGTCGCGCGT CGCCTTCGCC 420 
GGGAGCCGTC CGCGCAGAGC GTGCACTTCT CGGGCGAG ATG TCG GAG CGC AGA 473 

Met Ser Glu Arg Arg 
1 5 

GAA GGC AAA GGC AAG GGG AAG GGC GGC AAG AAG GAC CGA GGC TCC GGG 521 
Glu Gly Lys Gly Lys Gly Lys Gly Gly Lys Lys Asp Arg Gly Ser Gly 

10 \ 15 20 

AAG AAG CCC GTG CCC GCG GCT GGC GGC CCG AGC CCA GCC TTG CCT CCC 569 
Lys Lys Pro Val Pro Ala Ala Gly Gly Pro Ser Pro Ala Leu Pro Pro 

25 30 35 

CGC TTG AAA GAG ATG AAG ATG CAG GAG TCT GTG GCA GGT TCC AAA CTA 617 
Arg Leu Lys Glu Met Lys Ser Gin Glu Ser Val Ala Gly Ser Lys Leu 
40 45 50 

GTG CTT CGG TGC GAG ACC AGT TCT GAA TAC TCC TCT CTC AAG TTC AAG 665 

Val Leu Arg Cys Glu Thr Ser Ser Glu Tyr Ser Ser Leu Lys Phe Lys 

55 60 65 

3 S 

ffiTGG TTC AAG AAT GGG AGT GAA TTA AGC CGA AAG AAC AAA CCA CAA AAC 713 

Qlrp Phe Lys Asn Gly Ser Gly Leu Ser Arg Lys Asn Lys Pro Gin Asn 

h] 70 75 80 85 

qATC AAG ATA CAG AAA AGG CCG GGG AAG TCA GAA CTT CGC ATT AGC AAA 761 
iTIle Lys lie Gin Lys Arg Pro Gly Lys Ser Glu Leu Arg lie Ser Lys 

i\GCG TCA CTG GCT GAT TCT GGA GAA TAT ATG TGC AAA GTG ATC AGC AAA 809 
^Ala Ser Leu Ala Asp Ser Gly Glu Tyr Met Cys Lys Val He Ser Lys 
105 110 115 

WcTA GGA AAT GAC AGT GCC TCT GCC AAC ATC ACC ATT GTG GAG TCA AAC 857 
ttJLeu Gly Asn Asp Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn 
ffl 120 125 130 

GAG ATC ACC ACT GGC ATG CCA GCC TCA ACT GAG ACA GCG TAT GTG TCT 905 
Glu lie Thr Thr Gly Met Pro Ala Ser Thr Glu Thr Ala Tyr Val Ser 
135 140 145 

TCA GAG TCT CCC ATT AGA ATA TCA GTA TCA ACA GAA GGA ACA AAT ACT 953 
Ser Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Thr Asn Thr 
150 155 160 165 

TCT TCA TCC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAG 1001 
Ser Ser Ser Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lys 

170 175 180 



158 



TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC -1049 
Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe 
185 190 195 

ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG TGC AAG TGC CCA 1097 
Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro 
200 205 210 

AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC 1145 
Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe 
215 220 225 

TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT GAA TAGGCGCATG 1193 
Tyr S r Thr Ser Thr Pro Phe Leu Ser Leu Pro Glu 
230 235 240 

CTCAGTCGGT GCCGCTTTCT TGTTGCCGCA TCTCCCCTCA GATTCAACCT AGAGCTAGAT 1253 

GCGTTTTACC AGGTCTAACA TTGACTGCCT CTGCCTGTCG CATGAGAACA TTAACACAAG 1313 

CGATTGTATG ACTTCCTCTG TCCGTGACTA GTGGGCTCTG AGCTACTCGT AGGTGCGTAA 1373 

GGCTCCAGTG TTTCTGAAAT TGATCTTGAA TTACTGTGAT ACGACATGAT AGTCCCTCTC 1433 

ACCCAGTGCA ATGACAATAA. AGGCCTTGAA AAGTCTCACT TTTATTGAGA AAATAAAAAT 1493 

CGTTCCACGG GACAGTCCCT CTTCTTTATA AAATGACCCT ATCCTTGAAA AGGAGGTGTG 1553 

= ~TTAAGTTGTA ACCAGTACAC ACTTGAAATG ATGGTAAGTT CGCTTCGGTT CAGAATGTGT 1613 

^TCTTTCTGAC AAATAAACAG AATAAAAAAA AAAAAAAAAA A 1652 



P(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 149: 

h (i) SEQUENCE CHARACTERISTICS: 

& (A) LENGTH: 1140 

= (B) TYPE: nucleic acid 

u (C) STRANDEDNESS : single 

D (D) TOPOLOGY: linear 

m (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 



^CAT CAN GTG TGG GCG GCG AAA GCC GGG GGC TTG AAG AAG GAC TCG CTG 
: His Gin Val Trp Ala Ala Lys Ala Gly Gly Leu Lys Lys Asp Ser Leu 
1 5 10 15 



48 



CTC ACC GTG CGC CTG GGC GCC TGG GGC CAC CCC GCC TTC CCC TCC TGC 
Leu Thr Val Arg Leu Gly Ala Trp Gly His Pro Ala Phe Pro Ser Cys 
20 25 30 



96 



GGG CGC CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC ATG GAG CCC GAG 
Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe Met Glu Pro Glu 
35 40 45 



144 



GCC AAC AGC AGC GGC GGG CCC GGC CGC CTT CCG AGC CTC CTT CCC CCC 
Ala Asn Ser Ser Gly Gly Pro Gly Arg Leu Pro Ser Leu Leu Pro Pro 
50 55 60 



192 



159 



i 



TCT 
Ser 
65 


CGA 
Arg 


GAC 
Asp 


GGG 
Gly 


CCG 
Pro 


GAA 
Glu 
70 


CCT 
Pro 


CAA 
Gin 


GAA 
Glu 


GGA 
Gly 


GGT 
Gly 
75 


CAG 
Gin 


CCG 
Pro 


GGT 
Gly 


GCT 
Ala 


GTG 
Val 
80 


240 


CAA 
Gin 


CGG 
Arg 


TGC 
Cys 


GCC 
Ala 


TTG 
Leu 
85 


CCT 
Pro 


CCC 
Pro 


CGC 
Arg 


TTG 
Leu 


AAA 

Lys 
90 


GAG 
Glu 


ATG 
Met 


AAG 
Lys 


AGT 
Ser 


CAG 
Gin 
95 


GAG 
GlU 


288 


TCT 
Ser 


GTG 
Val 


GCA 
Ala 


GGT 
Gly 
100 


TCC 
Ser 


AAA 
Lys 


CTA 
Leu 


GTG 
Val 


CTT 
Leu 
105 


CGG 
Arg 


TGC 
Cys 


GAG 
Glu 


ACC 
Thr 


AGT 
Ser 
110 


TCT 
Ser 


GAA 
Glu 


336 


TAC 
Tyr 


TCC 
Ser 


TCT 
Ser 
115 


CTC 
Leu 


AAG 
Lys 


TTC 
Phe 


AAG 
Lys 


TGG 
Trp 
120 


TTC 
Phe 


AAG 
Lys 


AAT 
Asn 


GGG 
Gly 


AGT 
Ser 
125 


GAA 
Glu 


TTA 
Leu 


AGC 
Ser 


384 


CGA 
Arg 


AAG 
Lys 
130 


AAC 
Asn 


AAA 
Lys 


CCA 
Pro 


GAA 
Glu 


AAC 
Ash 
135 


ATC 
He 


AAG 
Lys 


ATA 
He 


CAG 
Gin 


AAA 
Lys 
140 


AGG 
Arg 


CCG 
Pro 


GGG 
Gly 


AAG 
Lys 


432 


TCA 
Ser 
145 


GAA 
Glu 


CTT 
Leu 


CGC 
Arg 


ATT 
He 


AGC 
Ser 
150 


AAA 
Lys 


GCG 
Ala 


TCA 
Ser 


CTG 
Leu 


GCT 
Ala 
155 


GAT 
Asp 


TCT 
Ser 


GGA 
Gly 


GAA 
Glu 


TAT 
Tyr 
160 


480 


Eqatg 

%jMet 


TGC 
Cys 


AAA 
Lys 


GTG 
Val 


ATC 
He 
165 


AGC 
Ser 


AAA 
Lys 


CTA 
Leu 


GGA 
Gly 


AAT 
Asn 
170 


GAC 
Asp 


AGT 
Ser 


GCC 
Ala 


TCT 
Ser 


GCC 
Ala 
175 


AAC 
Asn 


528 


?=ATC 
Jllle 


ACC 
Thr 


ATT 
He 


GTG 
Val 
180 


GAG 
Glu 


TCA 
Ser 


AAC 
Asn 


GCC 
Ala 


ACA 
Thr 
185 


TCC 
Ser 


ACA 
Thr 


TCT 
Ser 


ACA 
Thr 


GCT 
Ala 
190 


GGG 
Gly 


ACA 
Thr 


576 


f = AGC 
C Ser 

ru 


CAT 
His 


CTT 
Leu 
195 


GTC 
Val 


AAG 
Lys 


TGT 
Cys 


GCA 
Ala 


GAG 
Glu 
200 


AAG 
Lys 


GAG 
Glu 


AAA 
Lys 


ACT 
Thr 


TTC 
Phe 
205 


TGT 
Cys 


GTG 
Val 


AAT 
Asn 


624 


WGGA 
SGly 

i™ 
s^: : 


GGC 

Gly 
210 


GAG 
Glu 


TGC 
Cys 


TTC 
Phe 


ATG 
Met 


GTG 

val 

215 


AAA 
Lys 


GAC 
Asp 


CTT 
Leu 


TCA 
Ser 


AAT 
Asn 
220 


CCC 
Pro 


TCA 
Ser 


AGA 
Arg 


TAC 
Tyr 


672 


TTG 
Leu 
225 


TGC 
Cys 


AAG 
Lys 


TGC 
Cys 


CAA 
Gin 


CCT 
Pro 
230 


GGA 
Gly 


TTC 
Phe 


ACT 
Thr 


GGA 
Gly 


GCG 
Ala 
235 


AGA 
Arg 


TGT 
Cys 


ACT 
Thr 


GAG 
Glu 


AAT 
Asn 
240 


720 


GTG 
Val 


CCC 
Pro 


ATG 
Met 


AAA 
Lys 


GTC 
Val 
245 


CAA 
Gin 


ACC 
Thr 


CAA 
Gin 


GAA 
Glu 


AAG 
Lys 
250 


TGC 
Cys 


CCA 
Pro 


AAT 
Asn 


GAG 
Glu 


TTT 
Phe 
255 


ACT 
Thr 


768 


GGT 

Gly 


GAT 
Asp 


CGC 
Arg 


TGC 
Cys 


CAA 
Gin 


AAC 
Asn 


TAC 
Tyr 


GTA 
Val 


ATG 
Met 


GCC 
Ala 


AGC 
Ser 


TTC 
Phe 


TAC 
Tyr 


AGT 
Ser 


ACG 
Thr 


TCC 
Ser 


816 



260 265 270 



160 



ACT CCC TTT CTG TCT CTG CCT GAA TAGCGCATCT CAGTCGGTGC CGCTTTCTTG 870 
Thr Pro Phe Leu Ser Leu Pro Glu 
275 280 

TTGCCGCATC TCCCCTCAGA TTCCNCCTAG AGCTAGATGC GTTTTACCAG GTCTAACATT 930 

GACTGCCTCT GCCTGTCGCA TGAGAACATT AACACAAGCG ATTGTATGAC TTCCTCTGTC 990 

CGTGACTAGT GGGCTCTGAG CTACTCGTAG GTGCGTAAGG CTCCAGTGTT TCTGAAATTG 1050 

ATCTTGAATT ACTGTGATAC GACATGATAG TCCCTCTCAC CCAGTGCAAT GACAATAAAG 1110 

GCCTTGAAAA GTCAAAAAAA AAAAAAAAAA 1140 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 150: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1764 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



G AAG TCA GAA CTT CGC ATT AGC AAA GCG TCA CTG GCT GAT TCT GGA GAA 
^ Lys Ser Glu Leu Arg lie Ser Lys Ala Ser Leu Ala Asp Ser Gly Glu 

U i .5 10 15 

Stat atg tgc aaa gtg atc agc aaa cta gga aat gac agt gcc tct gcc 

WTyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp Ser Ala Ser Ala 

01 20 25 3 0 

j ti i; 

MAAC ATC ACC ATT GTG GAG TCA AAC GCC ACA TCC ACA TCT ACA GCT GGG 

■£Asn lie Thr lie Val Glu Ser Asn Ala Thr Ser Thr Ser Thr Ala Gly 

.7 35 40 45 



49 



97 



145 



3ACA AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG 
jlThr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val 



50 



55 



60 
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JJfAAT GGA GGC GAC TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA 
y Asn Gly Gly Asp Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg 
65 70 75 80 



241 



TAC TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG 
Tyr Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu 

85 90 95 



289 



AAT GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC 
Asn Val Pro Met Lys Val Gin Thr Gin Glu Lys Ala Glu Glu Leu Tyr 
100 105 110 

CAG AAG AGA GTG CTC ACC ATT ACC GGC ATT TGC ATC GCG CTG CTC GTG 
Gin Lys Arg Val Leu Thr lie Thr Gly lie Cys lie Ala Leu Leu Val 
115 120 125 



337 



385 



161 



GTC TAC TGC AAA ACC AAG^A^A 



GTT GGC ATC ATG TGT GTG GTG GTC TAC TGC AAA ACC AAG AAA CAA CGG 433 
Val Gly He Met Cys Val Val Val Tyr Cys Lys Thr Lys Lys Gin Arg 
130 135 140 

AAA AAG CTT CAT GAC CGG CTT CGG CAG AGC CTT CGG TCT GAA AGA AAC 481 
Lys Lys Leu His Asp Arg L u Arg Gin S r Leu Arg Ser Glu Arg Asn 
145 150 155 160 

ACC ATG ATG AAC GTA GCC AAC GGG CCC CAC CAC CCC AAT CCG CCC CCC 529 
Thr Met Met Asn Val Ala Asn Gly Pro His His Pro Asn Pro Pro Pro 

165 170 175 

GAG AAC GTG CAG CTG GTG AAT CAA TAC GTA TCT AAA AAT GTC ATC TCT 577 
Glu Asn Val Gin Leu Val Asn Gin Tyr Val Ser Lys Asn Val He Ser 
180 185 190 

AGC GAG CAT ATT GTT GAG AGA GAG GCG GAG AGC TCT TTT TCC ACC AGT 625 
Ser Glu His He Val Glu Arg Glu Ala Glu Ser Ser Phe Ser Thr Ser 
195 200 205 

CAC TAC ACT TCG ACA GCT CAT CAT TCC ACT ACT GTC ACT CAG ACT CCC 673 

His Tyr Thr Ser Thr Ala His His Ser Thr Thr Val Thr Gin Thr Pro 
210 215 220 

o 

\jAGT CAC AGC TGG AGC AAT GGA CAC ACT GAA AGC ATC ATT TCG GAA AGC 721 

ySer His Ser Trp Ser Asn Gly His Thr Glu Ser lie lie Ser Glu Ser 
^225 230 235 240 

HCAC TCT GTC ATC GTG ATG TCA TCC GTA GAA AAC AGT AGG CAC AGC AGC 769 
\THis Ser Val He Val Met Ser Ser Val Glu Asn Ser Arg His Ser Ser 
- 245 250 255 

JfCCG ACT GGG GGC CCG AGA GGA CGT CTC AAT GGC TTG GGA GGC CCT CGT 817 
Spro Thr Gly Gly Pro Arg Gly Arg Leu Asn Gly Leu Gly Gly Pro Arg 
Hi 260 265 270 

iy 

yiJGAA TGT AAC AGC TTC CTC AGG CAT GCC AGA GAA ACC CCT GAC TCC TAC 865 
QiGlu Cys Asn Ser Phe Leu Arg His Ala Arg Glu Thr Pro Asp Ser Tyr 
275 280 285 

CGA GAC TCT CCT CAT AGT GAA AGA CAT AAC CTT ATA GCT GAG CTA AGG 913 
Arg Asp Ser Pro His Ser Glu Arg His Asn Leu He Ala Glu Leu Arg 
290 295 300 

AGA AAC AAG GCC CAC AGA TCC AAA TGC ATG CAG ATC CAG CTT TCC GGA 961 
Arg Asn Lys Ala His Arg Ser Lys Cys Met Gin He Gin Leu Ser Ala 
305 310 315 320 

ACT CAT CTT AGA GCT TCT TCC ATT CCC CAT TGG GCT TCA TTC TCT AAG 1009 
Thr His Leu Arg Ala Ser Ser He Pro His Trp Ala Ser Phe Ser Lys 

325 330 335 
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ACC CCT TGG CCT TTA GGA AGG TAT GTA TCA GCA ATG ACC ACC CCG GCT 1057 
Thr Pro Trp Pro Leu Gly Arg Tyr Val S r Ala Met Thr Thr Pro Ala 
340 345 350 

CGT ATG TCA CCT GTA GAT TTC CAC ACG CCA AGC TCC CCC AAG TCA CCC 1105 
Arg Met Ser Pro Val Asp Phe His Thr Pro Ser Ser Pro Lys Ser Pro 
355 360 365 

CCT TCG GAA ATG TCC CCG CCC GTG TCC AGC ACG ACG GTC TCC ATG CCC 1153 
Pro Ser Glu Met Ser Pro Pro Val Ser Ser Thr Thr Val Ser Met Pro 
370 375 380 

TCC ATG GCG GTC AGT CCC TTC GTG GAA GAG GAG AGA CCC CTG CTC CTT 1201 
Ser Met Ala Val Ser Pro Phe Val Glu Glu Glu Arg Pro Leu Leu Leu 
385 390 395 400 

GTG ACG CCA CCA CGG CTG CGG GAG AAG TAT GAC CAC CAC GCC CAG CAA 1249 
Val Thr Pro Pro Arg Leu Arg Glu Lys Tyr Asp His His Ala Gin Gin 

405 410 415 

TTC AAC TCG TTC CAC TGC AAC CCC GCG CAT GAG AGC AAC AGC CTG CCC 1297 

Phe Asn Ser Phe His Cys Asn Pro Ala His Glu Ser Asn Ser Leu Pro 
q 420 425 430 

rJcCC AGC CCC TTG AGG ATA GTG GAG GAT GAG GAA TAT GAA ACG ACC CAG 1345 

=1Pro Ser Pro Leu Arg lie Val Glu Asp Glu Glu Tyr Glu Thr Thr Gin 
j*{ 435 440 445 

pGAG TAC GAA CCA GCT CAA GAG CCG GTT AAG AAA CTC ACC AAC AGC AGC 1393 

HgIu Tyr Glu Pro Ala Gin Glu Pro Val Lys Lys Leu Thr Asn Ser Ser 
450 455 460 

IfCGG CGG GCC AAA AGA ACC AAG CCC AAT GGT CAC ATT GCC CAC AGG TTG 1441 

□Arg Arg Ala Lys Arg Thr Lys Pro Asn Gly His lie Ala His Arg Leu 

rU465 470 475 480 

we 3 

yJGAA ATG GAC AAC AAC ACA GGC GCT GAC AGC AGT AAC TCA GAG AGC GAA 1489 
mGlu Met Asp Asn Asn Thr Gly Ala Asp Ser Ser Asn Ser Glu Ser Glu 

485 490 495 

ACA GAG GAT GAA AGA GTA GGA GAA GAT ACG CCT TTC CTG GCC ATA CAG 1537 
Thr Glu Asp Glu Arg Val Gly Glu Asp Thr Pro Phe Leu Ala lie Gin 
500 505 510 

AAC CCC CTG GCA GCC AGT CTC GAG GCG GCC CCT GCC TTC CGC CTG GTC 1585 
Asn Pro Leu Ala Ala Ser Leu Glu Ala Ala Pro Ala Phe Arg Leu Val 
515 520 525 

GAC AGC AGG ACT AAC CCA ACA GGC GGC TTC TCT CCG CAG GAA GAA TTG 1633 
Asp Ser Arg Thr Asn Pro Thr Gly Gly Phe Ser Pro Gin Glu Glu Leu 
530 535 540 
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CAG GCC AGG CTC TCC GGT GTA ATC GCT AAC CAA GAC CCT ATC GCT GTC 1681 
Gin Ala Arg Leu Ser Gly Val lie Ala Asn Gin Asp Pro lie Ala Val 
545 550 555 560 

TAAAACCGAA ATACACCCAT AGATTCACCT GTAAAACTTT ATTTTATATA ATAAAGTATT 1741 
CCACCTTAAA TTAAACAAAA AAA 1764 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 151: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 

10 15 

Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 

_ 20 25 30 

Kpro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser 

Nj 35 40 45 

fflPhe Tyr 

O 50 

L(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 152: 

h (i) SEQUENCE CHARACTERISTICS: 

?y (A) LENGTH: 50 

hi (B) TYPE: amino acid 

.\S (C) STRANDEDNESS: 

£ (D) TOPOLOGY: linear 

yi (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys 
15 10 15 

Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu Cys Lys Cys 
20 25 30 

Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn Val Pro Met Lys 
35 40 45 

Val Gin 
50 



164 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 153: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 



Glu Cys Leu Arg 
1 

Tyr Val Lys Glu 
20 

Phe Gly Glu Arg 
35 



Lys Tyr Lys Asp 
5 

Leu Arg Ala Pro 

Cys Gly Glu Lys 
40 



Phe Cys lie His 
10 

Ser Cys Lys Cys 
25 

Ser Asn Lys Thr 



Gly Glu Cys Lys 
15 

Gin Gin Glu Tyr 
30 

His Ser 
45 



_(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 154: 

P (i) SEQUENCE CHARACTERISTICS: 

Cy (A) LENGTH: 198 

N (B) TYPE: nucleic acid 

iy (C) STRANDEDNESS: single 

m (D) TOPOLOGY: linear 

O (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

7AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
T:S r His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
Si 5 10 15 

LJllGGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
^Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
~ 20 25 30 

TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

GTA ATG GCC AGC TTC TAC AGT ACG TCC ACT CCC TTT CTG TCT CTG CCT 
Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro Phe Leu Ser Leu Pro 
50 55 60 

GAA TAG 

Glu 

65 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 155: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 192 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 
Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
L u Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 

t . 

GTG CCC ATG AAA GTC CAA ACC CAA GAA AAA GCG GAG GAG CTC TAC TAA 192 
pVal Pro Met Lys Val Gin Thr Gin Glu Lys Ala Glu Glu Leu Tyr 
fn 50 55 60 



i(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 156: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 

(B) TYPE: nucleic acid 

s (C) STRANDEDNESS: single 

JT (D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



yJAGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
yiSer His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 



48 



GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
20 25 30 



96 



TTG TGC AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 
Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 ! 40 45 



144 



GTA ATG GCC AGC TTC TAC AAA GCG GAG GAG CTC TAC TAA 
Val Met Ala Ser Phe Tyr Lys Ala Glu Glu Leu Tyr 
50 55 60 



183 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 157: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



AGC 


CAT 


Ser 


His 


1 




GGA 


GGC 


Gly 


Gly 


TTG 


TGC 


Leu 


Cys 


QGTA 


ATG 


S3Val 


Met 




50 


Sirs? 

fllGCG 


GAG 


QAla 


Glu 


U65 





CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 

Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
5 10 15 

GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 

Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

20 25 30 

AAG TGC CCA AAT GAG TTT ACT GGT GAT CGC TGC CAA AAC TAC 144 

Lys Cys Pro Asn Glu Phe Thr Gly Asp Arg Cys Gin Asn Tyr 
35 40 45 

GCC AGC TTC TAC AAG CAT CTT GGG ATT GAA TTT ATG GAG AAA 192 

Ala Ser Phe Tyr Lys His Leu Gly lie Glu Phe Met Glu Lys 

55 60 

GAG CTC TAC TAA 210 



L=(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 158: 
%i (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 
! * (B) TYPE: nucleic acid 

*J (C) STRANDEDNESS: single 

^ 3 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



AGC CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 48 

S r His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
15 10 15 

GGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 96 

Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 

20 25 30 

TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 144 
Leu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
35 40 45 
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GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 192 
Val Pro Met Lys Val Gin Thr Gin Glu Lys Cys Pro Asn Glu Phe Thr 
50 55 60 

GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AGT ACG TCC 240 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Ph Tyr Ser Thr Ser 
65 70 75 80 

ACT CCC TTT CTG TCT CTG CCT GAA TAG 267 
Thr Pro Phe Leu Ser Leu Pro Glu 

85 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 159: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



PK3C CAT CTT GTC AAG TGT GCA GAG AAG GAG AAA ACT TTC TGT GTG AAT 
%JSer His Leu Val Lys Cys Ala Glu Lys Glu Lys Thr Phe Cys Val Asn 
yi 5 10 15 



48 



pGGA GGC GAG TGC TTC ATG GTG AAA GAC CTT TCA AAT CCC TCA AGA TAC 
y,Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser Asn Pro Ser Arg Tyr 
? ~ 20 25 30 



96 



:.TTG TGC AAG TGC CAA CCT GGA TTC ACT GGA GCG AGA TGT ACT GAG AAT 

LXeu Cys Lys Cys Gin Pro Gly Phe Thr Gly Ala Arg Cys Thr Glu Asn 
" 35 40 45 



144 



^GTG CCC ATG AAA GTC CAA ACC CAA GAA AAG TGC CCA AAT GAG TTT ACT 
©Val Pro Met Lys Val Gin Thr Gin Glu Lys Cys Pro Asn Glu Phe Thr 
(ji 50 55 60 



192 



GGT GAT CGC TGC CAA AAC TAC GTA ATG GCC AGC TTC TAC AAA GCG GAG 
Gly Asp Arg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Lys Ala Glu 
65 70 75 80 



240 



GAG CTC TAC TAA 
Glu Leu Tyr 



252 



168 



i 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 160: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 



CC ACA TCC ACA TCT ACA GCT GGG ACA AGC CAT CTT GTC AAG TGT GCA 47 
Thr Ser Thr Ser Thr Ala Gly Thr Ser His Leu Val Lys Cys Ala 
1 5 10 15 

GAG AAG GAG AAA ACT TTC TGT GTG AAT GGA GGC GAG TGC TTC ATG GTG 95 
Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val 

20 25 30 

AAA GAC CTT TCA AAT CCC TCA AGA TAC TTG T GC 128 
Lys Asp Leu Ser Asn Pro Ser Arg Tyr Leu 
35 40 

_(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 161: 

y (i) SEQUENCE CHARACTERISTICS: 

W (A) LENGTH: 141 

N! (B) TYPE: nucleic acid 

W (C) STRANDEDNESS: single 

Si (D) TOPOLOGY: linear 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Th CAT AAC CTT ATA GCT GAG CTA AGG AGA AAC AAG GCC CAC AGA TCC 46 
y= His Asn Leu lie Ala Glu Leu Arg Arg Asn Lys Ala His Arg Ser 
□ 1 5 10 15 

sag |! 

pyAAA TGC ATG CAG ATC CAG CTT TCC GCA ACT CAT CTT AGA GCT TCT TCC 94 
«Lys Cys Met Gin lie Gin Leu Ser Ala Thr His Leu Arg Ala Ser Ser 

m 20 25 30 

ATT CCC CAT TGG GCT TCA TTC TCT AAG ACC CCT TGG CCT TTA GGA AG 141 
lie Pro His Trp Ala Ser Phe Ser Lys Thr Pro Trp Pro Leu Gly Arg 
35 40 45 



169 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 162: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 15 and 22 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 



Ala Ala Glu Lys Glu Lys Thr Phe Cys Val Asn Gly Gly Glu Xaa Phe 
1 5 10 15 

Met Val Lys Asp Leu Xaa Asn Pro 
20 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 163: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



Katg aga tgg cga cgc gcc ccg CGC CGC TCC GGG CGT CCC GGC CCC CGG 48 
tflMet Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
= 1 5 10 15 

GGCC CAG CGC CCC GGC TCC GCC GCC CGC TCG TCG CCG CCG CTG CCG CTG 96 
fUAla Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu Pro Leu 

fy 20 25 30 

{SCTG CCA CTA CTG CTG CTG CTG GGG ACC GCG GCC CTG GCG CCG GGG GCG 144 
"Leu Pro Leu Leu Leu Leu Leu Gly Thr Ala Ala Leu Ala Pro Gly Ala 
35 40 45 

GCG GCC GGC AAC GAG GCG GCT CCC GCG GGG GCC TCG GTG TGC TAC TCG 192 
Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys Tyr Ser 
50 55 60 

TCC CCG CCC AGC GTG GGA TCG GTG CAG GAG CTA GCT CAG CGC GCC GCG 240 
Ser Pro Pro Ser Val Gly Ser Val Gin Glu Leu Ala Gin Arg Ala Ala 
65 70 75 80 

GTG GTG ATC GAG GGA AAG GTG CAC CCG CAG CGG CGG CAG CAG GGG GCA 288 
Val Val lie Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin Gly Ala 

85 90 95 

170 



CTC GAC AGG AAG GCG GCG GCG GCG GCG GGC GAG GCA GGG eCG TGG GGC 

Leu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 

100 '? 105 110 
I 

GGC GAT CGC GAG CCG CCA GCC GCG GGC CCA CGG GCG CTG GGG CCG CCC 

Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly Pro Pro 

115 120 125 

GCC GAG GAG CCG CTG CTC GCC GCC AAC GGG ACC GTG CCC TCT TGG CCC 
Ala Glu Glu Pro Leu Leu Ala Ala Asn Gly Thr Val Pro Ser Trp Pro 
130 135 140 

ACC GCC CCG GTG CCC AGC GCC GGC GAG CCC GGG GAG GAG GCG CCC TAT 
Thr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
145 150 155 160 

CTG GTG AAG GTG CAC CAG GTG TGG GCG GTG AAA GCC GGG GGC TTG AAG 
Leu Val Lys Val His Gin Val Trp Ala Val Lys Ala Gly Gly Leu Lys 

165 170 175 

AAG GAC TCG CTG CTC ACC GTG CGC CTG GGG ACC TGG GGC CAC CCC GCC 
Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His Pro Ala 
180 185 190 

mTTC CCC TCC TGC GGG AGG CTC AAG GAG GAC AGC AGG TAC ATC TTC TTC 
Hphe Pro Ser Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe 
1 71 195 200 205 

HJaTG GAG CCC GAC GCC AAC AGC ACC AGC CGC GCG CCG GCC GCC TTC CGA 
rfMet Glu Pro Asp Ala Asn Ser Thr Ser Arg Ala Pro Ala Ala Phe Arg 
Z 210 215 220 

f GCC TCT TTC CCC CCT CTG GAG ACG GGC CGG AAC CTC AAG AAG GAG GTC 
jfAla Ser Phe Pro Pro Leu Glu Thr Gly Arg Asn Leu Lys Lys Glu Val 
^225 230 235 240 

WAGC CGG GTG CTG TGC AAG CGG TGC G 
C ; Ser Arg Val Leu Cys Lys Arg Cys 

01 245 



336 



384 



432 



480 



528 



576 



624 



672 



720 



745 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 164: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in position 1 is unknown, 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Xaa Ala Leu Ala Ala Ala Gly Tyr Asp Val Glu Lys 
1 5 10 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 165: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
f (ix) FEATURE: 

{ (D) OTHER INFORMATION: Xaa in position 1 is unknown, 

1 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



JXaa Leu Val Leu Arg 

= 1 5 



m(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 166: 

m (i) SEQUENCE CHARACTERISTICS: 

[J (A) LENGTH: 11 

£ (B) TYPE: amino acid 

w= (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(D) OTHER INFORMATION: Xaa in positions 1, 2, and 3 is unknown. 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Xaa Xaa Xaa Tyr Pro Gly Gin lie Thr Ser Asn 
15 10 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 167: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 

(B) TYPE: nucl ic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ix) FEATURE: 

(D) OTHER INFORMATION: N in positions 25 and 36 is unknown. 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
ATAGGGAAGG GCGGGGGAAG GGTCNCCCTC NGCAGGGCCG GGCTTGCCTC TGGAGCCTCT 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 168: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 
1 (B.) TYPE: nucleic acid 

1 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
| (ix) FEATURE: 

(D) OTHER INFORMATION: N in position 16 is unknown, 
j (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 



TTTACACATA TATTCNCC 18 



;(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 169: 
-J (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169: 

Glu Thr Gin Pro Asp Pro Gly Gin lie Leu Lys Lys Val Pro Met Val 
1 5 10 15 

lie Gly Ala Tyr Thr 
20 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 170: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Met Arg Trp Arg Arg Ala Pro Arg Arg Ser Gly Arg Pro Gly Pro Arg 
15 10 15 

Ala Gin Arg Pro Gly Ser Ala Ala Arg Ser Ser Pro Pro Leu Pro Leu 
20 25 30 

Leu Pro Leu Leu Leu Leu Leu Gly Thr Ala Ala Leu Ala Pro Gly Ala 
35 40 45 

Ala Ala Gly Asn Glu Ala Ala Pro Ala Gly Ala Ser Val Cys Tyr Ser 
50 55 60 

Ser Pro Pro Ser Val Gly Ser Val Gin Glu Leu Ala Gin Arg Ala Ala 

3 65 70 75 80 

"IVal Val lie Glu Gly Lys Val His Pro Gin Arg Arg Gin Gin Gly Ala. 
1 85 90 95 

ieu Asp Arg Lys Ala Ala Ala Ala Ala Gly Glu Ala Gly Ala Trp Gly 
j 100 105 110 

3 Gly Asp Arg Glu Pro Pro Ala Ala Gly Pro Arg Ala Leu Gly Pro Pro 
115 i 120 125 

3Ala Glu Glu Pro Leu Leu Ala Ala Asn Gly Thr Val Pro Ser Trp Pro 

J 130 135 140 

e 

3Thr Ala Pro Val Pro Ser Ala Gly Glu Pro Gly Glu Glu Ala Pro Tyr 
1145 150 155 160 

Leu Val Lys Val His Gin Val Trp Ala Val Lys Ala Gly Gly Leu Lys 

165 170 175 

Lys Asp Ser Leu Leu Thr Val Arg Leu Gly Thr Trp Gly His Pro Ala 
180 185 190 

Phe Pro Ser Cys Gly Arg Leu Lys Glu Asp Ser Arg Tyr lie Phe Phe 
195 200 205 

Met Glu Pro Asp Ala Asn Ser Thr Ser Arg Ala Pro Ala Ala Phe Arg 
210 215 220 
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i^^i Thr Gly Arg Asn Leu Ly^^i 



Ala Ser Phe Pro Pro Leu Glu Thr Gly Arg Asn Leu Lys l^ys Glu Val 
225 230 235 240 

Ser Arg Val Leu Cys Lys Arg Cys Ala Leu Pro Pro Gin Leu Lys Glu 

245 250 255 

Met Lys Ser Gin Glu Ser Ala Ala Gly Ser Lys Leu Val Leu Arg Cys 
260 265 270 

Glu Thr Ser Ser Glu Tyr Ser Ser Leu Arg Phe Lys Trp Phe Lys Asn 
275 280 285 

Gly Asn Glu Leu Asn Arg lys Asn Lys Pro Gin Asn lie Lys lie Gin 
290 295 300 

Lys Lys Pro Gly Lys Ser Glu Leu Arg lie Asn Lys Ala Ser Leu Ala 
305 310 315 320 

Asp Ser Gly Glu Tyr Met Cys Lys Val lie Ser Lys Leu Gly Asn Asp 

325 330 335 

Ser Ala Ser Ala Asn lie Thr lie Val Glu Ser Asn Ala Thr Ser Thr 
340 345 350 

U.S r Thr Thr Gly Thr Ser His Leu Val Lys Cys Ala Glu Lys Glu Lys 

r*J 355 . 360 365 

2? Thr Phe Cys Val Asn Gly Gly Glu Cys Phe Met Val Lys Asp Leu Ser 

& 370 375 380 

^Asn Pro Ser Arg Tyr Leu Cys Lys Cys Pro Asn Glu Phe Thr Gly Asp 

y3385 390 395 400 

MArg Cys Gin Asn Tyr Val Met Ala Ser Phe Tyr Ser Thr Ser Thr Pro 
O 405 410 415 

fy 

niPhe Leu Ser Leu Pro Glu 
^ 420 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 171: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

Met Ser Glu Arg Lys Glu Gly Arg Gly Lys Gly Lys Gly Lys Lys Lys 
1 5 10 15 
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Glu Arg Gly Ser Gly Lys Lys Pro Glu Ser Ala Ala Gly Ser Gin Ser 

20 25 30 

Pro Arg Glu lie lie Thr Gly Met Pro Ala Ser Thr Glu Gly Ala Tyr 

35 40 45 

Val Ser Ser Glu Ser Pro lie Arg lie Ser Val Ser Thr Glu Gly Ala 

50 55 60 



Asn Thr Ser Ser Ser 
65 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 172: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



JAjArg Lys Gly Asp Val Pro Gly Pro Arg Val Lys Ser Ser Arg Ser Thr 
y. 1 5 10 15 

CpThr Thr Ala 



\~k(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 173: 

h (i) SEQUENCE CHARACTERISTICS: 

fu (A) LENGTH: 231 

m (B) TYPE: nucleic acid 

!g (C) STRANDEDNESS: single 

^ (D) TOPOLOGY: linear 

w " (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



CGCGAGCGCC TCAGCGCGGC CGCTCGCTCT 
CGATCCGAGC CCTTGGACCA AACTCGCCTG 
TCTCCGGCGA GATGTCCGAG CGCAAAGAAG 
AGCGAGGCTC CGGCAAGAAG CCGGAGTCCG 



CCCCCTCGAG GGACAAACTT TTCCCAAACC 
CGCCGAGAGC CGTCCGCGTA GAGCGCTCCG 
GCAGAGGCAA AGGGAAGGGC AAGAAGAAGG 
CGGCGGGCAG CCAGAGCCCA G 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 174: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



CCTTGCCTCC CCGATTGAAA GAGATGAAAA GCCAGGAATC GGCTGCAGGT TCCAAACTAG 
TCCTTCGGTG TGAAACCAGT TCTGAATACT CCTCTCTCAG ATTCAAGTGG TTCAAGAATG 
GGAATGAATT GAATCGAAAA AACAAACCAC AAAATATCAA GATACAAAAA AAGCCAGG 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 175: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

J GAAGTCAGAA CTTCGCATTA ACAAAGCATC ACTGGCTGAT TCTGGAGAGT ATATGTGCAA 
1 AGTGATCAGC AAATTAGGAA ATGACAGTGC CTCTGCCAAT ATCACCATCG TGGAATCAAA 
JCG 



i (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 176: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 



AGATCATCAC TGGTATGCCA GCCTCAACTG AAGGAGCATA TGTGTCTTCA GAGTCTCCCA 
TTAGAATATC AGTATCCACA GAAGGAGCAA ATACTTCTTC AT 
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# 

(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 177: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 128 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 



CTACATCTAC ATCCACCACT GGGACAAGCC ATCTTGTAAA ATGTGCGGAG AAGGAGAAAA 60 
CTTTCTGTGT GAATGGAGGG GAGTGCTTCA TGGTGAAAGA CCTTTCAAAC CCCTCGAGAT 120 
ACTTGTGC 1?a 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 178: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



PJAAGTGCCAAC CTGGATTCAC TGGAGCAAGA TGTACTGAGA ATGTGCCCAT GAAAGTCCAA 60 
MAACCAAGAA fiQ 



M,(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 179: 
,n (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 
Li (B) TYPE: nucleic acid 

h (C) STRANDEDNESS: single 

il (D) TOPOLOGY: linear 

f=J (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 

j-jjTCGGGCTCCA TGAAGAAGAT GTA 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 180 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

TCCATGAAGA AGATGTACCT GCT 
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(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 181 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

ATGTACCTGC TGTCCTCCTT GA 22 



(2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 182 
(i) SEQUENCE CHARACTifclSTICS: 

(A) LENGTH: 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TTGAAGAAGG ACTCGCTGCT CA 22 



flj (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 183 

SJ (i) SEQUENCE CHARACTERISTICS: 

y (A) LENGTH: 20 

m (B) TYPE: nucleic acid 

m (C) STRANDEDNESS: single 

r: (D) TOPOLOGY: linear 

£ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

8 AAAGCCGGGG GCTTGAAGAA 20 



I (2) INFORMATION FOR SEQUENCE IDENTIFICATION NUMBER: 184 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

ATGARGTGTG GGCGGCGAAA 20 
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